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GENERAL THEORY FOR INTERACTIONS IN SUFFICIENT 
CAUSE MODELS WITH DICHOTOMOUS EXPOSURES 

By Tyler J. VanderWeele 1 and Thomas S. Richardson 2 

Harvard University and University of Washington 

The sufficient-component cause framework assumes the existence 
of sets of sufficient causes that bring about an event. For a binary out- 
come and an arbitrary number of binary causes any set of potential 
outcomes can be replicated by positing a set of sufficient causes; typi- 
cally this representation is not unique. A sufficient cause interaction is 
said to be present if within all representations there exists a sufficient 
cause in which two or more particular causes are all present. A singu- 
lar interaction is said to be present if for some subset of individuals 
there is a unique minimal sufficient cause. Empirical and counterfac- 
tual conditions are given for sufficient cause interactions and singular 
interactions between an arbitrary number of causes. Conditions are 
given for cases in which none, some or all of a given set of causes 
affect the outcome monotonically. The relations between these re- 
sults, interactions in linear statistical models and Pearl's probability 
of causation are discussed. 

1. Introduction. Rothman's sufficient-component cause model [25] pos- 
tulates a set of different causal mechanisms, each sufficient to bring about the 
outcome under consideration. Rothman refers to these hypothesized causal 
mechanisms as "sufficient causes," conceiving of them as minimal sets of 
actions, events or states of nature which together initiate a process resulting 
in the outcome. 

Thus each sufficient cause is hypothesized to consist of a set of "compo- 
nent causes." Whenever all components of a particular sufficient cause are 
present, the outcome occurs; within every sufficient cause, each component 
would be necessary for that sufficient cause to lead to the outcome. Models 
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of this kind have a long history: a simple version is considered by Cayley [4]; 
it also corresponds to the INUS model introduced by Mackie [14] in the 
philosophical literature; see also [3] for an early application. Much recent 
work has sought to relate the model to other causal modeling frameworks 
[9, 11, 35, 37, 39]. 

In traditional sufficient-component cause [SCC] models, the outcome and 
all the component causes are events, or equivalently, binary random vari- 
ables. An SCC model with k component causes implies a set of 2 k potential 
outcomes. Conversely, in Section 2 we show that for any given list of po- 
tential outcomes there is at least one SCC model which represents this set. 
However, in general there may be many such SCC models. 

One question concerns whether, given a set of potential outcomes implied 
by some (unknown) SCC model, one may infer that two component causes 
are present within some sufficient cause in the unknown SCC model. In gen- 
eral, it is possible that two SCC models both imply the same set of potential 
outcomes, yet although A and B occur together in some sufficient component 
cause in the first model, A and B are not present together in any sufficient 
component cause in the second. In [39] two sufficient component causes are 
said to form a "sufficient cause interaction" (or to be "irreducible" ) if they 
are both present within at least one sufficient cause in every SCC model for 
a given set of potential outcomes. Of course, in general, the distribution of 
potential outcomes for a given population is also unknown, though it is con- 
strained (marginally) by the observed data from a randomized experiment. 
In [39] empirical conditions are given which are sufficient to ensure that for 
any set of potential outcomes compatible with experimental data, all com- 
patible SCC models will contain a sufficient cause involving A and B. These 
results were an improvement upon earlier empirical tests for the existence of 
a two-way interaction in an SCC model [26], which required the assumption 
of monotonicity; see also [1, 12, 13, 17, 38]. The new results are able to es- 
tablish the existence of an interaction in situations where monotonicity does 
not hold. In this paper we develop empirical conditions that are sufficient for 
the existence of a sufficient cause containing a given subset of an arbitrary 
number of variables, both with and without monotonicity assumptions. 

As illustrative motivation for the theoretical development, we will consider 
data presented in a study by [31], summarized in Table 1, from a case- 
control study of bladder cancer examining possible three-way interaction 
between smoking (1 = present), and genetic variants on NAT2 (0 = R, 1 = 5* 
genotype) and NAT1 (1 for the *10 allele) for Caucasian individuals. We 
return to this example at the end of this paper to examine the evidence for 
a sufficient cause containing all three: smoking, the S genotype on NAT2 
and the *10 allele on NATL 

The remainder of this paper is organized as follows: Section 2 presents 
the sufficient-component cause framework as formalized by VanderWeele 
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Table 1 

Case-control data from a study of bladder cancer [31] 



Smoking 


NAT2 


NAT1*10 


Cases 
(n = 215) 


Controls 
(n = 191) 


Odds ratio (95% CI) 











6 


13 


1 








1 


8 


16 


1.1 (0.3, 3.9) 





1 





16 


31 


1.1 (0.4, 3.5) 





1 


1 


6 


10 


1.3 (0.3, 5.3) 


1 








42 


32 


2.8 (1.1, 8.3) 


1 





1 


41 


26 


3.4 (1.2, 10.1) 


1 


1 





61 


51 


2.6 (0.9, 7.3) 


1 


1 


1 


35 


12 


6.3 (2.0, 20.3) 



and Robins [39]. Section 3 describes general n-way irreducible interactions 
(aka "sufficient cause interactions") and characterizes these in terms of po- 
tential outcomes. Section 4 derives empirical conditions for the existence of 
irreducible interactions both with and without monotonicity assumptions. 
Section 5 describes "singular" interactions which arise in genetic contexts, 
provides a characterization, derives empirical conditions that are sufficient 
for their existence and relates this notion to Pearl's probability of causa- 
tion. Section 6 discusses the relation between singular and sufficient cause 
interactions and linear statistical models. Section 7 provides some comments 
regarding stronger interpretations of sufficient cause models, and returns to 
the data presented in Table 1. Finally Section 8 offers some possible exten- 
sions to the present work. 

2. Notation and basic concepts. We will use the following notation: An 
event is a binary random variable taking values in {0, 1}. We use uppercase 
roman to indicate events (X), boldface to indicate sets of events (C), and 
lowercase to indicate specific values both for single random variables {X = 
x), and, with slight abuse of notation, for sets {C = c} = {Vi, (C)j = (c)j} 
and {a < b} = {Vz,(a)j < (b)j}; 1 and are vectors of l's and 0's; the 
cardinality of a set is denoted |C|. We use fraktur (53) to denote collections 
of sets of events. 

The complement of some event X is denoted by X = 1 — X . A literal event 
associated with X, is either X or X. For a given set of events C, L(C) is 
the associated set of literal events 

L(C) = Cu{X|XeC}. 

For a literal L € L(C), and an assignment c to C, (L) c denotes the value as- 
signed to L by c. The conjunction of a set of literal events B = {Fi , . . . , F m } C 
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L(C) is defined as 

m 

f\(B) = Y[F i = mm{F 1 ,...,F m }; 

i=l 

note that = 1 if and only if for all i, Fi = 1. We also define B\ A 

Bi = /\{i?i,i?2}- We will use 1(^4) to denote the indicator function for event 
A. There is a simple correspondence between conjunctions of literals and 
indicator functions: let B = {X±, . . . , X s } and C = {Yi, . . . , Yf}, then 

(2.1) /\({X 1 ,...,X s ,Y 1 ,...,Y t }) = l & I({B = 1,C = 0}) = 1. 

Similarly, the set of literals corresponding to an assignment c to C is defined 

B[ c l = {L|LeL(C),(L) c = l} 

so that A( b[c] ) =I(C = c); note that |BW| = |C|. The disjunction of a set 
of binary random variables is defined as 

V ( { Zi , . . . , Z p } ) = max{Zi , . . . , Z p }; 

note that \f({Z±, . . . , Z p }) = 1 if and only if for some j, Zj = 1. Simi- 
larly we let B\ V F>2 = \J{Bi,B2}- Given a collection of sets of literals 
33 = {Ci, . . . , C q }, we define 

VA( ,S )-V(A(^))- 

i 

We use P(L(C)) to denote the set of subsets of L(C) that do not contain 
both X and X for any X G C; more formally, 

P(L(C)) = {B|B C L(C) for all X G C, {X,X} £ B}. 

Note tha^if B G P(L(C)), and |B| = |C|, so that for all C G C, exactly one 
of C or C is in B, then an assignment of values b to B induces a unique 
assignment c to C and vice versa. 

2.1. Potential outcomes models. Consider a potential outcome model [27, 
28, 30] with s binary factors, X±, . . . ,X S , which represent hypothetical in- 
terventions or causes, and let D denote some binary outcome of interest. 
We use VL to denote the sample space of individuals in the population and 
use oj for a particular sample point. Let D xl ^^ tXg (uj) denote the counterfac- 
tual value of D for individual uj if the cause Xj were set to the value Xj 
for j = 1, ...,s. The potential outcomes framework we employ makes two 
assumptions: first, that for a given individual these counterfactual variables 
are deterministic; second, in asserting that the counterfactual D x1 ^„ jXs (oj) is 
well defined, it is implicitly assumed that the value that D would take on for 
individual uj is determined solely by the values that X±, . . . ,X S are assigned 
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Table 2 

All potential outcomes and actual outcomes for three binary causes, X\, X% and Xz, in 

a population with two individuals 



Individual 


-Dooo -D001 


-D010 Don Z?ioo Dial Duo "111 


(Xi, X2, X3) 


D 


1 


1 


10 110 


(1,0,1) 


1 


2 


1 


10 111 


(0,0,0) 






for this individual, and not the assignments made to these variables for other 
individuals uj' . This latter assumption is often called "no interference" [7], 
or the stable unit treatment value assumption (SUTVA) [29]. An example 
of a situation where this assumption might fail is a vaccine trial where there 
is "herd" immunity. 

We will use D xl> _ >Xa (uj), Dx 1 = xi ,...,x 3 =x 3 (^), D c and D c=c (uj), with C = 
{X\, . . . ,X S } interchangeably. In this setting there will be 2 s potential out- 
comes for each individual uj in the population, one potential outcome for each 
possible value of (X\, . . . ,X S ); we use V(C;uj) to denote the set of all such 
potential outcomes for an individual, and P(C;0) for the population. Note 
that if G = g(C) is some deterministic function of C, then (?c= c (w) = <?(c), 
and hence is constant; thus our usage is consistent with the definition of 
(L) c in the previous section. 

The actual observed value of D for individual uj will be denoted by D(uj) 
and similarly the actual value of X\ , . . . , X s by X\ (to), ... , X s (co) . Actual and 
counterfactual outcomes are linked by the consistency axiom which requires 
that 

(2-2) D Xl=Xl(u)r „ tXt=Xt(u) (u) = D(u), 

that is, that the value of D which would have been observed if X\ , . . . , X s had 
been set to the values they actually took is equal to the value of D which was 
in fact observed [22]. It follows from this axiom that -Dxi(w),...,x s (o;)( w ) = D 
is observed, but it is the only potential outcome for individual uj that is 
observed. 

Example 1. Consider a binary outcome D with three binary causes 
of interest, Xi, X2 and A3. Suppose that the population consists of two 
individuals. The potential outcomes (left-hand side) and actual outcomes 
(right-hand side) are shown in Table 2. 

We use the notation A _LL B\C to indicate that A is independent of B, 
conditional on C in the population distribution. 

2.2. Definitions for sufficient cause models. The following definitions 
generalize those in [39] to sub-populations, 7^ C ft: 
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Definition 2.1 (Sufficient cause). A subset B of the putative (binary) 
causes L(C) for D forms a sufficient cause for D (relative to C) in sub- 
population £1* if for all c € {0, 1}I C I such that (/\(B)) C = 1, D c {uj) = 1 for 
all w£Sl*cn, [We require that there exists a c* such that (/\(B)) C * = 1.] 

Observe that if B is a sufficient cause for D, then any intervention setting 
the variables C to c with (/\(B)) C = 1 will ensure that D c (u) = 1 for all 
lo £ Q*. We restrict the definition to nonempty sets f2*, to preclude every set 
B being a sufficient cause in an empty sub-population. Likewise we require 
that there exists some c* such that (/\(B)) C * = 1 in order to avoid logically 
inconsistent conjunctions, for example, X\ AX\, being classified (vacuously) 
as a sufficient cause. As a direct consequence, for any binary random variable 
X, at most one of X and X appear in any sufficient cause B. 

Proposition 2.2. In ft* if B is a sufficient cause for D relative to C, 
then B is sufficient for D in any set C* with B C C* C C. 

B may be sufficient for D relative to C in Q* , but not relative to C'dC. 

Proposition 2.3. If B is a sufficient cause for D relative to C in ST, 
then B is sufficient for D relative to C in any subset ^ f2** C Q* . 

B may be sufficient for D relative to C in Q*, but not in ft'Dfi*. 

Definition 2.4 (Minimal sufficient cause). A set B C L(C) forms a 
minimal sufficient cause for D (relative to C) in sub-population Q* if B 
constitutes a sufficient cause for D in f2*, but no proper subset B* C B also 
forms a sufficient cause for D in Q*. 

Note that (in some 0*) B may be a minimal sufficient cause for D relative 
to C, but not relative to C* C C, so the analog of Proposition 2.2 does not 
hold. For individual 2 in Table 2 {Xi,A3} is a minimal sufficient cause 
relative to {X\, X2, A3}. However, if we suppose that for oj = 2, X2 is not 
caused by X\ and A3, so for all xi, X3, X2x±=x 1 ,x 3 =x a {u = 2) = X2{uj = 2), 
then {Ai, A3} is not a minimal sufficient cause relative to {Ai, A3}. 

Dx r =o,x 3 =i{^ = 2) = D Xl =o,x 2 =o,x 3 =i(u = 2) = 1, 

D Xl =i,x 3 =i(u = 2) = D Xl =i,x 2 =o,X3=i(u = 2) = 1 

[since A2(w = 2) = 0]; hence A3 is a sufficient cause of D relative to {Ai, A3}; 
hence {Ai, A3} is not minimal relative to {Ai, A3} for u = 2. 

Similarly, if B is a minimal sufficient cause for D relative to C in fi*, 
it does not follow that B is a minimal sufficient cause for D relative to 
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C in subsets f2** C fi*, so the analog to Proposition 2.3 does not hold. In 
particular, it may be the case that for all cj € f2*, B is not a minimal sufficient 
cause for D in {oj}. 

In the language of digital circuit theory [15], sufficient causes are termed 
"implicants," and minimal sufficient causes are "prime implicants." 

Definition 2.5 (Determinative set of sufficient causes). A set of suffi- 
cient causes for D, 23 = {Bi, . . . ,B n } C P(L(C)), is said to be determinative 
for D (relative to C) in sub-population £1* if for all and for all c, 

D c (uj) = 1 if and only if (V A(®))c = 1- 

We will refer to a determinative set of sufficient causes for D as a sufficient 
cause model. Observe that in any sub-population 0* for which there exists a 
determinative set of sufficient causes, the vectors of potential outcomes for 
D are identical, so V(C,uj) = P(C,u/) for all co,u)' € 0*. 

Definition 2.6 (Nonredundant set of sufficient causes). A determina- 
tive set of sufficient causes 23, for D, is said to be nonredundant (in Q* , 
relative to C) if there is no proper subset 23* C 23 that is also determinative 
for D. 

Note that sufficient causes are conjunctions, while sets of sufficient causes 
form disjunctions of conjunctions; minimality refers to the components in a 
particular conjunction, that each component is required for the conjunction 
to be sufficient for D; nonredundancy implies that each conjunction is re- 
quired for the disjunction of the set of conjunctions to be determinative. If 
for some set of sufficient causes 23 C P(L(C)), for all X 6 C, and all B G 23, 
either IgB or IgB, then 23 is a nonredundant set of sufficient causes. 

Example 1 (Revisited). The set 23 x = {{X 1 , X 2 }, {X 2 , X 3 }, {X 2 , X 3 }} 
forms a determinative set of sufficient causes for the individual co = 2, since 

(2.3) D c (oj = 2) = ((Xx A X 2 ) V {X 2 A X 3 ) V (X 2 A X 3 )) c 
as does ^ 2 = {{X 1 ,X 3 },{X 2 ,X 3 },(X 2 ,X 3 }}: 

(2.4) D c (lj = 2) = ((Xi A X 3 ) V (X 2 A X 3 ) V (X 2 A X 3 )) c . 

As this example shows, determinative sets of sufficient causes are not, in 
general, unique. 

2.3. Sufficient cause representations for a population. As noted, if B is 
a sufficient cause for D in Q* , then all the units in 0* will have D = 1 for 
any assignment c to C, such that (/\(B)) C = 1. In most realistic settings 
it is unlikely that any set B will be sufficient to ensure D = 1 in an entire 
population. Consequently, different sets of sufficient causes will be required 
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within different sub-populations. A sufficient cause representation is a set of 
sub-populations, each with its own determinative sufficient cause represen- 
tation. 

Definition 2.7. A sufficient cause representation (A, 53) for V(C; fl) is 
an ordered set A = (A±, . . . , A p ) of binary random variables, with (Ai) c = A{ 
for all i,c, and a set 55 = (Bi, . . . , B p ), with Bj G P(L(C)), such that for all 
oj, c, D c (uj) = 1 for some j,Aj(oj) = 1 and (A(Bj)) c = 1. 

Note that the binary random variables Ai and the sets Bj are naturally 
paired via the orderings of A and 53; we will refer to a pair (Aj,Bj) as 
occurring in the representation. The requirement that (Aj) c = Ai for all i,c 
implies that A n C = 0, and further that the Ai are unaffected by inter- 
ventions on the Xi] this is in keeping with the interpretation of the Ai as 
defining pre-existing sub-populations with particular sets of potential out- 
comes for D. 

Proposition 2.8. If (A, 53) is a sufficient cause representation for 
T>(C;Q), then Bj is a sufficient cause of D in the sub-population in which 
Ai(oj) = l. 

Proposition 2.9. If (A, 23) is a sufficient cause representation for 
V(C;n), then for all A* C A, if 

^1\a* =H f° r allAi£A,Ai(u) = l iffAiEA*}, 

then 

53 A * = {Bj|Bj G 53; A4 G A*} 

forms a determinative set of sufficient causes (relative to C) for ^ A \ A »- 

Note that ^ A \A* cons ists of the sub-population in which Ai(oj) = 1 for 
all Ai G A* and Aj(uS) = for all Aj G A \ A*. 

Proof. Suppose for some oj G ^^\A*' ^3 e 25 A *\ an d c we have 
(A(Bj)) c = 1. Since oj G ^^\a*' A?^) = 1- ^ then follows from the defi- 
nition of a sufficient cause representation that D c (oj) = 1. Conversely, sup- 
pose D c (oj) = 1. As (A, 53) is a sufficient cause representation, for some j, 
Aj(u) = 1 and (/\(Bj)) c = 1. Since, by hypothesis, oj G ^ a \ a , , it follows 

that Aj G A*, hence B^ G 53 A * . □ 

Theorem 2.10. For any T>(C;Q), there exists a sufficient cause repre- 
sentation (A, 53). 

Proof. Let p = 2l c l, and define 53 = {B|B C P(L(C)), |B| = |C|} = 
(Bi,...,B p ), ordered arbitrarily. Further define Ai(oj) =D^ i= i(oj). Given 
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an arbitrary c, for some j, = Bj, by construction of 03. We then have 
D e (u) = l & D Bj=1 (u) = l & Aj(u) = l and (/\(Bj)) = 1 

as required. The last step follows since by definition B^ = 1 if and only if 
C = c. □ 

[39] prove this for the case of |C| = 2; see also [9] and [35] for discussion 
of the case |C| = 1. 

Example 1 (Revisited). The construction given in the proof of The- 
orem 2.10 would yield the following sets of sufficient causes to represent 
X>(C;fi) shown in Table 2: 

<B = (Bi,...,B 8 ) 

(2.5) = ({X 1 ,X 2 ,X 3 },{X 1 ,X 2 ,X 3 },{X 1 ,X 2 ,X 3 },{X 1 ,X 2 ,X 3 }, 

{Xi,X 2 ,X 3 }, {Xi,X 2 ,X 3 }, {X\,X 2 ,X 3 }, {Xi,X 2 , X 3 }} 
with At = I({tjj = 2}),A 4 = A 5 = A 8 = 0, A 2 = A 3 = A 6 = A 7 = 1. 

3. Irreducible conjunctions. We saw in Example 1 above with uj = 2 
that an individual's potential outcomes may be such that there are two 
determinative sets of common causes 03 and 23' and {X\ , X 2 } is in 03, but not 
in 03'. However, certain conjunctions are such that in every representation 
either the conjunction is present or it is contained in some larger conjunction; 
such conjunctions are said to be "irreducible." 

Definition 3.1. Be P(L(C)) is said to be irreducible for V(C, O) if in 
every representation (A, 03) for X>(C,0), there exists Bj £ 03, with B C B,;. 

[39] also refer to irreducibility of B for T>(C,Q) as a "sufficient cause 
interaction" between the components of B. (Note, however, that if B is irre- 
ducible, this does not, in general, imply that B is either a minimal sufficient 
cause, or even a sufficient cause, only that there is a sufficient cause that 
contains B.) It can be shown (via Theorem 3.2 below) that {X 2 ,X 3 } and 
{X 2 ,X 3 } are irreducible for X>(C;0) in Table 2. In Section 7 we provide 
an interpretation of irreducibility in terms of the existence of a mechanism 
involving the variables in B. Using Ci U C 2 to indicate the disjoint union of 
Ci and C2, we now characterize irreducibility: 

Theorem 3.2. Let C = CiUC 2; B e P(L(Ci)), |B| = [Ci|. 
Then B is irreducible for 2?(C,f2) if and only if there exists w*e!] and 
values for C 2 such that: (i) D B=1C2=c »(w*) = 1; (ii) for all LgB, 
^B\{L}=l,L=0,C 2 =c* (w*) = 0. 
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Thus B is irreducible if and only if there exists an individual in who 
would have response D = 1 if every literal in B is set to 1, but D = 
whenever one literal is set to and the rest to 1 (in some context C2 = cJj). 
Note that conditions (i) and (ii) are equivalent to 

(3.1) £>B=1,C 2 =C*(W*) - D B\{L}=l,L=0,C 2 =c*(u*) > 0. 

LeB 

Proof. (=>•) We adapt the proof of Theorem 2.10 to show that if for all 
oj G £2 and assignments to C2, at least one of (i) or (ii) does not hold, then 
there exists a representation (A, 05) for T>(C,Q) such that for all Bj G 55, 
B ^ Bj. Define 

25* = (Bj) = {B*|B* G P(L(C)), |B*| = |C|,B £ B*}, 

®* = (B|) = {B*|B* eP(L(C)),|B*| = |C| - 1,B\B* = {L},LgB}, 

under arbitrary orderings. Thus 25* is the set of subsets of exactly |C| literals 
that do not include B as a subset, while 55* contains those subsets of size 
|C| — 1 that contain all but one literals in B. 

For Bj G 25* define the corresponding A\(uj) = D B t =1 (o;); 

i 

For B] G 25* define A\{oj) =^ B t =u . =0 (w)f B t =li . =1 (w), where 
{Li} = B\B{. 

The representation is given by (A, 25) = (A* U A*, 25* U 25*), where A* = 
(An, A* = (A\). To see this, first note that if for some u and c, there 
is a pair (Aj,Bj) in (A, 25) such that Aj(u) = 1 and (/\(Bj)) c = 1. Then 
by construction of A* and A* it follows that D c (uj) = 1. For the converse, 
suppose that for some c and uj, D c (lo) = 1. There are two cases to consider: 

(A(B)) C = 0. In this case B ^ B [c l , so for some j, B* = B [c l , hence A-(cj) = 
D B t =1 (uj) = D c (oj) = 1, as required. 

(/\(B)) C = 1. Let c be partitioned as (ci,C2). Since (i) holds with = C2, 
(ii) does not. Thus for some L G B, ^B\{£}=l,£=o,C 2 =e 2 ( w ) = !• By construc- 
tion of 25*, for some j, B* = B^ \ {L}, so (A(B|)) C = 1. Since 1 = D c (u) = 
^b\{l}=i,l=i,C2=c 2 ( w ) 5 we have A ){u) = 1, as required. 

(•<=) Suppose for a contradiction, that for some u* and c|, (i) and (ii) 
hold, but B is not irreducible. Then there exists a representation (A, 25) 
such that for all B, G 25, B ^ Bj. By (i), -Db=1,c| = 1- Thus for some 
pair (Aj,Hj), Aj(uj*) = 1 and Bj C BUB^l . Since B ^ Bj there exists some 
L G B \ Bj, but then since Aj(u*) = 1 and (A( B j))B\{L}=i,L=o,c 2 =c* = 1, 
we have i^B\{L}=i,i=o,C 2 =c| = 1> which is a contradiction. □ 
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Corollary 3.3. IfB is irreducible for V(C,Q), then for any ST D Q,, 
B is irreducible for T>{C,Q,*) . 

Proof. By Theorem 3.2, since if £1 satisfies (i) and (ii), then so does fi*. 

□ 

3.1. B irreducible for D(C,£l) with |B| = |C|. In the special case where 
|B| = |C|, the concepts of minimal sufficient cause for some ui* and irre- 
ducibility coincide. 

Proposition 3.4. IfB eP(L(C)) and |B| = |C|, then B is a minimal 
sufficient cause for some co* € relative to C if and only if B is irreducible 
forV(C,tt). 

Proof. If |B| = |C|, then condition (i) in Theorem 3.2 (taking C2 = 0) 
holds if and only if B is a sufficient cause for D for 00*, and similarly condition 
(ii) holds if and only if B is a minimal sufficient cause for D (for uj*). □ 

Thus we have the following: 

Corollary 3.5. IfB eP(L(C)), |B| = |C| and B is a minimal suf- 
ficient cause for D for some uj* G £1, then B £ 23 for every representation 
(A, 58) for V(C,ft). 

Proof. Immediate from Proposition 3.4. □ 

3.2. B irreducible for V(C, O) with |B| < |C|. When |B| < |C|, the con- 
ditions for irreducibility and for being a minimal sufficient cause are logically 
distinct. Condition (i) in Theorem 3.2 requires -Db=i,c 2 =c^ (w*) = 1 for one 
assignment c*. (and some uj*), while if B is a sufficient cause (for uj*), then 
this condition is required to hold for all assignments ; in contrast condition 
(ii) in Theorem 3.2 requires that there exists a single (and some w*) such 
that for all L G B, ^b\{l}=i,l=o,c 2 =c*( w *) = 0, while for B to be a minimal 
sufficient cause for u* merely requires that for all L € B, there exists an 
assignment c% such that -Db\{l}=i l=o c 2 =c^( uj *) = 0- 

Example 1 (Revisited). Let C = {X 1 ,X 2 ,X 3 }, n = {2}. Relative to C, 
{X\, X2} is a minimal sufficient cause for uj = 2 since -Dm (2) = Dhq(2) = 1, 
and Don (2) = D\oq(2) = 0. However {X\,X2} is not irreducible for £>(C,f2) 
because we have Z?ioi(2) = Doio(2) = 1, hence condition (ii) in Theorem 3.2 
is not satisfied for either X3 = 0, or X3 = 1. Conversely {X\} is irreducible 
for = {2} since Dm (2) = 1, while Don (2) = 0, but {X\} is not a sufficient 
cause because Diqo(2) = 0. 
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Though irreducibility of B for !D(C,f2) neither implies, nor is implied by 
B being a minimal sufficient cause for some u € 0, it does imply that every 
sufficient cause representation for T>(C,Q) contains at least one conjunction 
Bj of which B is a (possibly proper) subset. However, prima facie this still 
leaves open the possibility that, for example, every representation either 
includes B U {L} or B U {L}, for some L, but no representation includes 
both. However, this cannot occur: 

Corollary 3.6. Let C = CiOC 2 , Be P(L(Ci)), if B is irreducible 
for P(C,0) then there exists a set B* £ P(L(C)), with |B*| = |C| such that 
in every representation (A, 05) for £>(C,f2) there exists Bj £ 05, with B C 
BjCB*. 

Thus irreducibility of B further implies that there is a set B* of size |C| 
such that in every representation there is at least one conjunct containing B 
that is itself contained in B*. However, it should be noted that, in general, 
there may be more than one conjunct Bj with BCBjCB*. 

PROOF. Immediate from Theorem 3.2, taking B* = B U . □ 

Finally, we note that a conjunction that is both irreducible and a mini- 
mal sufficient cause corresponds to an "essential prime implicant" in digital 
circuit theory [15]. The Quine-McCluskey algorithm [16, 20, 21] finds the 
set of essential prime implicants for a given Boolean function, which here 
corresponds to the potential outcomes T>(C,lj) for an individual. 

3.3. Enlarging the set of potential causes. As noted in Section 2.2 a set B 
may be a minimal sufficient cause for C but not a superset C. Irreducibility 
is also not preserved without further conditions. To state these conditions 
that preserve irreducibility we require the following: 

Definition 3.7. X' is said to be not causally influenced by a set C if 
for all u) € 0, the potential outcomes X' c=c (uj) are constant as c varies. 

We will also assume that if every X' € C is not causally influenced by C, 
then the following relativized consistency axiom holds: 

(3-2) 5c=c,c=c'( w )(w) = fc=c(w), 

that is, that if variables in C' are not causally influenced by the variables in 
C, then the counterfactual value of D intervening to set C to c is the same 
as the counterfactual value of D intervening to set C to c and the variables 
in C to the values they actually took on. 

We now have the following corollary to Theorem 3.2: 
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Corollary 3.8. Let C = CiUC 2 , BeP(L(Ci)), |B| = |Ci|. IfB is 
irreducible for P(C,0), C H C = and /or aZZ X' E C', X' is not causally 
influenced by C, then B is irreducible for T>(C U C,Q). 

Proof. By Theorem 3.2 there exists oj* € ft and an assignment to 
C2 such that (i) and (ii) hold. Let c' = C'(u*). Since variables in C are not 
causally influenced by C, for all assignments b, 

- D B=b,C 2 =c^C'=c'( a; *) = - D B=b,C 2 =c*,C'=C'(w*)( w *) = -DB=b,C 2 =c*(k/); 

the second equality here follows from (3.2). It follows that uj* and (c^c') 
obey (i) and (ii) in Theorem 3.2 with respect to C U C. □ 

The assumption that every variable in C' is not causally influenced by C, 
is required because otherwise we may have C'(uj*) 7^ (C')B=b* ( w *) f° r some 
assignment b* to B. For example, let C = {X\, X2, X3}, and suppose that 

Dx 1 =x 1 ,x 2 =x 2 ,x 3 =x 3 {u) = £3, 

{Xz)x 1 =x 1 ,X 2 =x 2 W) = x l A x 2 

for all u> G Q. In this case {X±,X2} is irreducible for T>({Xi, X2}, O), but 
not for D({Xi,X2,Xs},Q). We saw earlier that if B is a minimal sufficient 
cause for C, then this does not imply that B is a minimal sufficient cause 
with respect to subsets of C . Here we see that if B is irreducible with respect 
for V(C,Q), then this does not imply irreducibility for supersets C* D C, 
unless every variable in C* \ C is not causally influenced by a variable in C. 

4. Tests for irreducibility. In this section we derive empirical conditions 
which imply that a given conjunction B is irreducible for P(C, Q). Our first 
approach is via condition (3.1). 

4.1. Adjusting for measured confounders. To detect that (3.1) holds re- 
quires us to identify the mean of potential outcomes in certain subpopula- 
tions. This is only possible if we have no unmeasured confounders [22, 24]: 

Definition 4.1. A set of covariates W suffices to adjust for confounding 
of (the effect of) C on D if 

(4.1) D c =c-LLC|W = w 

for all c, w. 

Proposition 4.2. If a set W suffices to adjust for confounding of C 
on D and P(C = c, W = w) > 0, then 

S[£)c=c|W = w]=£7[D|C = c,W = w]. 
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The proof of this is standard and hence omitted. 

Note that if W is sufficient to adjust for confounding of C on D, then W 
is also sufficient to adjust for confounding of B on D, where B € P(L(C)), 
|B| = |C|. 

4.2. Tests for irreducibility without monotonicity. 

Theorem 4.3. Let C = CiUC 2; B e P(L(Ci)), |B| = |Ci|. If W is 
sufficient to adjust for confounding of C on D, and for some c 2 , w, 

< E[D\B = 1, C 2 = c 2 , W = wl, 

(4.2) 

- ^£[L>|B\{L} = l,L = 0,C 2 = c 2 ,W = w], 

LeB 

then B is irreducible for 2?(C,fi). 

Proof. We prove the contrapositive. Suppose that B is not irreducible 
for D(C,f2). Then by Theorem 3.2, for all oj € fl, and all c 2 , 

£>B=l,C 2 =c 2 M ~ ^ A}\{L}=l,L=0,C 2 =c 2 H < 0. 
Hence for any w, 



LeB 



E 



-Db=1,C 2 =c 2 — ^B\{L}=l,L=0,C 2 =c 2 



LeB 



W = w 



<0. 



Applying Proposition 4.2 to each term implies the negation of (4.2). □ 

The condition provided in Theorem 4.3 can be empirically tested with t- 
test type statistics if W consists of a small number of categorical or binary 
variables or using regression or inverse probability of treatment weighting 
methods [23, 41-43], otherwise. 

It follows from Corollary 3.8 that condition (4.2) further establishes that 
B is irreducible for D(C U C, CI) so long as every variable in C' is not 
causally influenced by variables in C. 

It may be shown that condition (4.2) is the sole restriction on the law of 
(D, C, W) implied by the negation of irreducibility. 

4.3. Graphs. In the next section we develop more powerful tests under 
monotonicity assumptions. However, to state these conditions we first intro- 
duce some concepts from graph theory: 

Definition 4.4. A graph (3 defined on a set B is a collection of pairs 
of elements in B, & = {E|E = {Bi,B 2 } CB,5i/ B 2 }. 



SUFFICENT CAUSE INTERACTIONS 



15 



This is the usual definition of a graph, except that the vertex set here is 
a set of literals. We will refer to sets in & as edges, which we will represent 
pictorially as B\ — Bi- 

Definition 4.5. Two elements L,L* € B are said to be connected in & 
if there exists a sequence L = L\, . . . , L p = L* of distinct elements in B such 
that {Li, Lj+i} G (5 for i = 1, . . . ,p — 1. 

The sequence of edges joining L and L* is said to form a path in ©. 

Definition 4.6. A graph © on B is said to form a tree if \<S\ = |B| — 1, 
and any pair of distinct elements in B are connected in (S. 

In a tree there is a unique path between any two elements. 

Proposition 4.7. Let 1 be a tree on B. For each element ReB there 
is a natural bijection 

4:B\{fi}HT 

given by (j>\{L) = E = {V , L} where EgT is the last edge on a path from R 
to L. 

It is not hard to show that for a graph (25, if the bijections described in 
Proposition 4.7 exist, then <& is a tree. 

Theorem 4.8 (Cayley [5]). On a set B there are |B|I B I~ 2 different trees. 

AA. Monotonicity. Sometimes it may be known that a certain cause has 
an effect on an outcome that is either always positive or always negative. 

Definition 4.9. Bi has a positive monotonic effect on D relative to a 
set B (with Bi E B) in a population 0, if for all u G ft and all values b_j for 
the variables in B \ {Bi}, -D B \{B I }=b_ l ,B l =i(w) > D B \ {Bi}=b _ i)Bi=0 (u). 

Similarly we say that L has a negative monotonic effect relative to BU{L} 
if L has a positive monotonic effect relative to B U {L}. Note that the case 
in which D B \ {Bi}=h _ iyBi=l (uj) = D B \ {Bi}=h _ iiBi=0 (uj) for all u, and hence 
Bi has no effect on D relative to B, is included as a degenerate case. 

The definition of a positive monotonic effect requires that an intervention 
does not decrease D for every individual, not simply on average, regardless 
of the other interventions taken. This is thus a strong assumption; see [40] 
for further discussion. 

Monotonic Boolean functions have been studied in other contexts: 
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Proposition 4.10. If for all Ci e C, Ci has a (positive or negative) 
monotonia effect on D relative to C, and k = \C\, then the number of distinct 
sets of potential outcomes in X>(C,0) is given by the kth Dedekind number 
(Dedekind [8], Wiedemann [44])- 

4.5. Tests for irreducibility with monotonicity. Knowledge of the mono- 
tonicity of certain potential causes allows for the construction of more pow- 
erful statistical tests for irreducibility than those given by Theorem 4.3. 

Theorem 4.11. Let C = CiUC 2 , B = (B+UB') € P(L(Ci)), |B| = 
|Cij and suppose that every L E B + has a positive monotonic effect on D 
relative to C. If for some tree T on B + , uj* € and some C2, we have 

0< £>B=l,C 2 =c 2 0>*), 

(4-3) 

~ Y - D B\{L}=l,L=0,C 2 =c 2 (^*) + ^ ^B\E=l,E=0,C 2 =c 2 ( w *) I 
LeB EG'I 

then B is irreducible for £>(C,f2). 

If we know that X has a negative monotonic effect on D, then we may 
use this theorem to construct more powerful tests of the irreducibility of 
sets containing X with respect to T>(C,£l). Under the assumption that ev- 
ery L £ C has a monotonic effect on D, we have shown via direct calculation 
using cddlib [10] that for |C| < 4, the conditions in (4.3) are the sole re- 
strictions on the law of (D, C, W) implied by the negation of irreducibility. 
We conjecture that this holds in general. 

Proof. By Theorem 3.2 it is sufficient to prove that under the mono- 
tonicity assumption on B + , (4.3) implies (3.1). Suppose that (3.1) does not 
hold, so that for all values C2, and all co* £ f2, 

£>B=l,C 2 =c 2 (w*) - - D B\{L}=l,L=0,C 2 =c 2 (w*) < 0. 

LeB 

Then for each there exists R £ B + such that 

£>b=i,c 2 =c 2 (^*)- Yl D n\{L}=i,L=o,c 2 = C2 (u*)<0. 
Le&'u{R} 

For a given tree T, the remaining terms on the right-hand side of (4.3) are 

- Y - D B\{L}=l,L=0,C 2 =c 2 (^*) + Y £) B\E=1,E=0,C 2 =c 2 (^*) 
LeB + \{_R} EeX 

= Y ( jD B\4(L)=l,</ ) |(L)=0,C 2 =c 2 ( w *) - ^B\{L}=l,L=0,C 2 =c 2 (^*)), 
L€B+\{R} 
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Fig. 1. The three trees on {Xi, X 2 ,X 3 }. 

by Proposition 4.7. Finally since 4>r{L) = {L,L'} C B + , L 1 has a positive 
monotonic effect on D relative to C, thus no term in the sum is positive. 
Consequently for all u* € ft, (4.3) does not hold for any tree %. □ 

Example 2. In the case B = {A"i, X 2 } = B + = C, there is only one tree 
on B + , consisting of a single edge X± — X%. Thus if X\ and X2 have a 
positive monotonic effect on D (relative to C) then Theorem 4.11 implies 
that if the following inequality holds for some u G f2, 

D u (oj) - (D 10 (u) + A)iM) + #oo (w) > 0, 
then {Xi,X 2 } is irreducible for £>(C,f2). 

Example 3. If B = {Xi,X 2 ,X 3 } = B + = C, then there are three trees 
on B_|_; see Figure 1. These correspond to the following conditions: 

(a) L»m (a;) - (Hiio(w) + D 101 (oj) + D 011 (u)) + (D 010 {cu) + D 001 (uj)) > 0, 

(b) D u1 (oj) - (Dno(w) + D 101 (u) + Dou(w)) + (£>iooH + AxnM) > 0, 

(c) D m (w) - (Dno(w) + D 101 (lo) + ArnM) + (D 100 (u) + AnoM) > 0. 
Thus B is irreducible for T>(C,Q) if at least one holds for some wGO. 

Corollary 4.12. Let C = Ci 0C 2 , B = (B+ UB') e P(L(Ci)), |B| = 
|Ci|. Suppose that every L £ B + aas a positive monotonic effect on D rela- 
tive to C, and W is sufficient to adjust for confounding of C on D. If for 
some tree % on B + , and some c 2; w we aaue 

< £[D|B = 1,C 2 = c 2 ,W = w] 

(4.4) - ^£ , [L>|B\{L} = l,L = 0,C 2 = c 2 ,W = w] 

LeB 

+ ^£7[D|B\E = l,E = 0,C 2 = C2,W = w], 
Eex 

then B is irreducible for T>(C,£l). 
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Proof. Directly analogous to the proof of Theorem 4.3. □ 

The special case of the previous Corollary where |B + | = |C| = 2, and 
W = 0, appears in Rothman and Greenland [26]; see also Koopman [13]. 
Theorem 4.8 implies that if every literal in B has a positive monotonic effect 
on D, then we will have |B|I B I~ 2 conditions to test, each of which is sufficient 
to establish the irreducibility of B for T>(C,£l). As before, the conditions 
(4.4) may be tested via i-test type statistics or using various statistical 
models. 

As with the results in Section 4.2, we may apply Corollary 3.8 to establish 
that B is irreducible for D(C U C, O) if every variable in C is not causally 
influenced by variables in C. 

4.6. Tests for a minimal sufficient cause under monotonicity. As noted 
in Section 3.1 if |B| = |C|, then irreducible conjunctions are also minimal 
sufficient causes. Thus in this special case, the tests of irreducibility given in 
Theorem 4.3 and Corollary 4.12 also establish that B is a minimal sufficient 
cause relative to C. When |B| < |C| these tests do not in general establish 
this. However, under positive monotonicity assumptions on C2, such tests 
may be obtained by taking c 2 = 0: 

Proposition 4.13. £e£C = CiUC 2 , BeP(L(Ci)), |B| = |Ci|. Sup- 
pose every L € C2 has a positive monotonic effect on D relative to C. // (i) 
£>b=i,c 2 =o(w*) = 1 and (ii) for all LgB, -Db\{l}=i,l=o,C2=o( w *) = °> then 
B is a minimal sufficient cause for D relative to C for oj* . 

Proof. For any c 2 , Db=i,c 2 =c 2 (w*) > -Db=i,c 2 =o(^*) = 1 by the mono- 
tonicity assumption. Hence B is a sufficient cause for D relative to C for 
oj* . Minimality follows directly from condition (ii). □ 

We have the following corollaries which provide conditions under which B 
is a minimal sufficient cause for D relative to C for some oj € f2, in addition 
to being irreducible for D(C,Q): 

Corollary 4.14. LeiC = CiUC 2 , B e P(L(Ci)), |B| = |Ci|. Suppose 
every L £ C 2 has a positive monotonic effect on D relative to C, and W is 
sufficient to adjust for confounding of C on D. If (4-2) holds with c 2 = for 
some w, then B is a minimal sufficient cause of D relative to C for some 

well. 

Proof. The proof follows immediately from Proposition 4.13 and The- 
orem 4.3. □ 
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Corollary 4.15. Let C = CiUC 2 , B = (B+UB') £P(L(Ci)), |B| = 
|Ci|. Suppose that every L € B + U C 2 has a positive monotonic effect on D 
relative to C, and W is sufficient to adjust for confounding of C on D. If 
(4-4) holds with c 2 = for some w and some tree T on B + , then B is a 
minimal sufficient cause of D relative to C for some oj € ft. 

Proof. The proof follows immediately from Proposition 4.13 and Corol- 
lary 4.12. □ 

5. Singular interactions. In the genetics literature, in the context of two 
binary genetic factors, X\ and X2, "compositional" epistasis [2, 6, 19] is 
said to be present if for some oj*, Du(oj*) = 1 but D\q(oj*) = Dq\{uj*) = 
Dqq(uj*) = 0; in this case the effect of one genetic factor is effectively masked 
when the other genetic factor is absent. If {Xi,^} is a minimal sufficient 
cause of D relative to {Xi, X 2 } for a;* then although this implies Dh(lj*) = 1 
and Dio(u*) = Dq\(oj*) = 0, it does not imply Dqq[oj*) = 0. This motivates 
the following: 

Definition 5.1. A minimal sufficient cause B for D relative to C for u* 
is said to be singular if there is no B' € P(L(C)), B' 7^ B, forming a minimal 
sufficient cause for D relative to C for uj*. B is singular for T>(C,Cl) if B is 
singular relative to C for some u* € ft. 

If B is singular for T>(C, fl), then we will also refer to a singular interaction 
between the components of B. We now characterize singularity in terms of 
potential outcomes: 

Theorem 5.2. Let C = Ci UC 2 , B e P(L(Ci)), |B| = |Ci|. Then B is 
singular for T>(C, O) if and only if there exists uj* € ft such that 

(5.1) for all values c*,,h : D B=b C2=c . (cj*) = 1 b=l. 
Note that (5.1) is equivalent to 

(5.2) D c =c(u*) = (A( B )) C for a11 c - 

Thus if B is singular for T>(C,ft), then there is some individual u* whose 
potential outcomes are given by the single conjunction B. 

Proof. By definition, B is a sufficient cause for D for oj* if and only if 
(b = 1 =^ D B=b C2=c *(w*) = 1). Thus it is sufficient to show that, assuming 
B is a minimal sufficient cause for D for oj* , there are no other minimal 
sufficient causes of D for oj* if and only if ( D B=b Ca=c * (cj*) = 1 b = 1). 

Suppose B is the only minimal sufficient cause for D for oj*, but that 
for some b* / 1, £> B =b*,c 2 =c* (oj*) = 1. Let Bt = B[ B=b *> C2=c 2] . Bt forms a 
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sufficient cause for D for u*, and B ^ B^. Hence there is some B' C B 1 " that 
is a minimal sufficient cause for D for co*, and B ^ B^, a contradiction. 

Conversely suppose (^B=b,c 2 =c^(w*) = 1 b = 1) but there exists an- 
other minimal sufficient cause B' for D for oj*, and B/B^ Since B' is 
minimal, B ^ B'. Thus there exists a c such that (B)e ^ 1, but (B')c = 1 
and hence Dq = c(uj*) = 1, a contradiction. □ 

Corollary 5.3. For D(C, Cl), i/B is singular then B is irreducible. 

Proof. The proof follows immediately from (5.1) and the definition of 
irreducibility. □ 

Theorem 5.4 relates singular interactions to properties of the set of suffi- 
cient cause representations for T>(C,Cl). 

Theorem 5.4. Let B G P(L(C)). B is singular for T>(C,Cl) if and only 
if there exists u* G Cl such that in every representation (A, 05) for V(C, Cl), 
(i) for all B* G P(L(C)), with |B*| = |C| and B C B* there exists Bj <E 
OS with Bi C B* and A^uj*) = 1; (ii) for all Bj € 03 such that B Bj, 
Moj*) = 0. 

Proof. Let C = CiUC 2 , where BeP(L(Ci)), and |B| = (Cij. 

(=>) Suppose B is singular for T>(C, Cl), so that some ui* G Cl satisfies (5.2). 
Then for any representation (A, 03) for V(C,Cl) and any B* such that 
|B*| = |C| and B C B*, we can select values c* 2 so that B* = B[ B=1 > C2=C 2] . 
Since ^b=i,c 2 =c*(w*) = 1 there exists At G A, Bj G 03 with Ai{uj*) = 1 and 
(A(Bj)) B =i,c 2 =c* = 1. Thus Bj C B*, so (i) holds. For all Bj G OS such that 
B Bj, we can choose B G P(L(Ci)), |B| = |Ci| with B / B and values c 2 
so that Bj C B[ B=1 ' C2=£a ]. Since L>g =1 C2= - C2 {u*) = we have = 

since (A(Bi))g=i c 2 =c 2 = ^> so ^olds as required. 

(•<=) Suppose there exists u>* G such that every representation (A, OS) 

satisfies (i) and (ii). We will show that (5.1) holds. For any values let B* = 
B [B=i,c 2 =c^ SQ | B *| = | C | and B q B * Thug by ^ there exigts B . G 

with Bj C B* and Ai(ui*) = 1. Hence £>b=i,c 2 =cJ = 1 since Ai(uj*) = 1 
and (A(Bj)) B =i,c 2 =c* = 1- Conversely for any b' + 1, let B' = B[ B=b 'l, so 
|B'| = |Ci| with B' / B. Thus for all Bj G OS such that (A(Bj)) B '=i,c 2 =c| = 
1, B g Bj and thus by (ii) A^cj*) = 0. Hence £>b'=i,c 2 =c|(^*) =0. □ 

We now consider results relevant for testing for singular interactions with 
or without monotonicity assumptions. 

Theorem 5.5. Let B = B + UB' G P(L(C)), |B| = |C| and suppose that 
every L G B + has a positive monotonic effect on D relative to C. If for some 
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tree T on B + and some w*£fi, we have 

-Db=i(^*)- ^ £'b\{l}=i,l=o(w*) 

LGB+ 

(5.3) 

E £> B\B=l,B=o( W *) + E D B\E=1 >E =0(^*) > 0, 
B:0^BCB' Ee ' A 

i/ien B is singular for X>(C,0). 

Proof. By Theorem 5.4, B is singular for X>(C,fi) if and only if 

(5.4) for some € O, £> B =i(w*) - E 5 b\b=i,b=o( w *) > °- 

BCB 

Suppose for a contradiction that (5.4) does not hold but (5.3) holds for 
some lo* G fi. Since B + has a positive monotonic effect on D relative to 
C, if B C B is such that B n B + / 0, then D BX ^ =1 g =0 (u;*) = 1 implies 

-C ) B\{L}=i,L=o( a; *) = 1 f° r some L G B. Hence for all wed, 

(5.5) £> B =lM " ^ OB\{i}=l,L=0(w) - E ^B\B=l,B=oH ^ °" 

L eB BCB',|B|>2 

By applying the same argument to the first two terms on the left-hand side 
of (5.5) as was applied in the proof of Theorem 4.11, we have that (5.3) does 
not hold for all co G fi, which is a contradiction. □ 

The following corollary to Theorem 5.5 generalizes the discussion in [32, 
33] to an arbitrary number of dichotomous factors: 

Corollary 5.6. Let B = B + UB' G P(L(C)), |B| = |C|. Suppose that 
every L G B + has a positive monotonic effect on D relative to B, and W is 
sufficient to adjust for confounding of C on D. If for some tree % on B + , 
and some w, we have 

0<£[L>|B = l,W = w] - E[D\B\{L} = 1,L = 0,W = w] 

LeB + 

(5.6) £[£>|B\B = l,B = 0,W = w] 

B: 0^BCB' 

+ Y E[D\B \ E = 1, E = 0, W = w], 

Eel 

then B is singular for T>(C,Q). 
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Proof. By applying Proposition 4.2 to each term in (5.3), the proof is 
complete. □ 

Condition (5.6) leads directly to a statistical test of compositional epis- 
tasis. This is notable since some claims in the genetics literature appear to 
suggest that such tests did not exist [6] . 

As stated in the next corollary, from Theorem 5.5, if all or all but one 
of the elements of B have positive monotonic effects on D, then singularity 
and irreducibility coincide: 

Corollary 5.7. Suppose |B| = |C| and that for all or all but one of 
Bi € B ; Bi has a positive monotonic effect on D relative to B, then B is 
singular for T>(C, O) if and only if B is irreducible for T>(C, £1). 

An important consequence of this corollary is that when there is at most 
one variable that does not have a positive monotonic effect, condition (4.4) 
establishes that B is singular in addition to being irreducible for V(C,Q). 

Proof. Let B' denote the one or zero elements of B that do not have 
a monotonic effect on D relative to C. If B is irreducible for P(C,0), then 
by the argument in the proof of Theorem 4.11, 

£>B=lK) " Yl A}\{L}=1,L= (W*) + Ai\E=l,E=o(w*) > 0. 
LeB EgX 

Since the third term on the left-hand side of (5.3) vanishes when |B'| < 1, it 
follows that B is singular for T>(C, O). The converse is given in Corollary 5.3. 

□ 

Corollary 5.8. Suppose |B| = |C| and that for all or all but one of 
Bi € BuC, Bi has a positive monotonic effect on D relative to B U C , 
for all X' € C , X' is not causally influenced by C and B is singular for 
V(C,n), then B is singular for D(CUC', fi). 

Proof. By Corollary 5.7, B is irreducible relative to T>(C,£l). Hence 
by Corollary 3.8 B is irreducible relative to T>(C U C, Q). The conclusion 
then follows from a further application of Corollary 5.7. □ 

5.1. Relation to Pearl's probability of causation. Pearl [18], Chapter 9, 
defined the probability of necessity and sufficiency (PNS) of cause X for 
outcome D to be P(D x =\{ui) = 1,D x =o(oj) = 0). In other words PNS(D, A") 
is the probability that D would occur if X occurred and would not have done 
so had X not occurred. We generalize this to the setting in which there are 
multiple causes B: 
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Definition 5.9. For B C P(L(C)), the probability of necessity and suf- 
ficiency o/B causing D is 

PNS(L>, B) = P(D B=1 = 1 and for all b + 1, D B=h = 0). 

Thus PNS(-D,B) is the probability that D would occur if every literal 
L € B occurred and would not have done so had at least one literal in B not 
occurred. 

Proposition 5.10. J/|B| = |C|, then PNS(D,B) > if and only if & 
is singular for T>(C, Q) . 

Proof. The proof follows directly from Theorem 5.4 and Definition 5.9. 

□ 

This connection also provides an interpretation for condition (5.6). For 
expositional convenience in the following proposition, we assume that B and 
D are unconfounded and do not make monotonicity assumptions; it would 
be straightforward to do so. 

Proposition 5.11. Under the conditions of Corollary 5.6, with W = 
= B + , PNS(B,D) is bounded below by the right-hand side of (5.6). 

Proof. 

PNS(£>, B) = P(D B=1 = 1 and for all b ^ l,D B=h = 0) 

> P(D B =i = 1) + P(for all b ^ 1, D B=h = 0) - 1 
= P(D B=1 = 1) - P(for some b ^ 1, D B=b = 1) 

> P(D B=1 = 1) - P ( D B=b = 1) 

b^i 

= E[D = 1|B = 1] - E[D\B = b] 
b^i 

which is the right-hand side of (5.6) with W = = B + . □ 

This generalizes some of the lower bounds on PNS(D, X) given by Pearl [18] , 
Section 9.2. 

6. Relation to statistical models with linear links. In related work [39] it 
is noted that the presence of interaction terms in statistical models do not, 
in general, correspond to sufficient conditions for irreducibility. Consider, 
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for example, a saturated Bernoulli regression model for D with identity link 
and binary regressors C = {Xi, . . . , X p }, 

(6.1) E[D\C = c}= E^b(A( § )) c " 

BCC 

Note that with c = (x\, . . . ,x p ), then (/\(B)) C = Ilx gb x *' ^ e usua l product 
interaction term. The conditions, given earlier, for detecting the presence of 
irreducibility and singularity lead to linear restrictions on the regression 
coefficients /3g. 

(6.2) £ m § /3g > 0. 

BCC 

Note that (6.2) includes an intercept /3 . First we define 

deg x (L) = |{E|EeX,LeE}|, 
the degree of L in a tree X, to be the number of edges in X that contain L. 

Proposition 6.1. Under the conditions of Theorem 4. 11, with B = C, 
condition (4-3) is equivalent to restriction (6.2) with mg =m~ red where 

m£ red = l-|B\B| + |X| 

(6.3) 

- deg I (L) + |{E|E€X,ECBnB+}|. 

LgBnB + 

Note that since X is a tree on B + , the last term in (6.3) has a natural 
graphical interpretation as the number of edges in the "induced subgraph" 
of X on the subset B. Definition (6.3) also subsumes condition (4.2) given 
in Theorem 4.3 (without monotonicity) , in which case the last three terms 
in (6.3) vanish. Though Proposition 6.1 assumes that C2 = 0, the condition 
given by (6.2) and (6.3) continues to apply in the case where C2 = 0, as 
in Corollaries 4.14 and 4.15; obvious extensions apply to the case where 
c 2 ^0. 

Proof. This follows by counting the number (and sign) of expectations 
in (4.3) for which B is a subset of the variables assigned the value 1 in the 
conditioning event. The first two terms in (4.3) correspond, respectively, to 
the first two terms in (6.3). The remaining three terms in (6.3) correspond 
to the last sum in (4.3): |X|, the number of edges in X, is the total num- 
ber of terms in the sum.^The sum over degrees subtracts the number of 
terms in which some L G B is assigned zero. Since this double counts terms 
corresponding to edges contained in B, the last term corrects for this. □ 
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Proposition 6.2. Under the conditions of Theorem 5.5, with B = C, 
condition (5.3) is equivalent to restriction (6.2) with mg = m~ ng where 

(6.4) m s -! ng = ml Tcd + (|B' \ Bj) - (2l B '\ § l - 1). 

Proof. Expression (6.4) follows from another counting argument simi- 
lar to the proof of Proposition 6.1, together with the observation that con- 
ditions (4.3) and (5.3) only differ in that the terms in the sum over L in 
(4.3) for LsB' are replaced by a sum over all subsets of B'. □ 

Example 4 (Two-way interactions). Consider the saturated Bernoulli 
regression with identity link with C = {X\,X2}. 

E[D\Xi =xi,X 2 = x 2 ] = /3 + /3ixi + P2X2 + Pi 2 xix 2 . 

Suppose that X\ and X2 are unconfounded with respect to D, so (4.1) holds 
with W = 0. Proposition 6.1 implies that {Xi,X 2 } is irreducible relative 
to C if P12 > f3 ; Proposition 6.2 implies that {X±,X 2 } is singular relative 
to C if P12 > 2/3 . If one of X\ or X2 have positive monotonic effects on D 
relative to C, then Proposition 6.1 and Corollary 5.7 imply that {X\,X2} 
is both irreducible and singular relative to C if f3\2 > ft®. If X\ and X2 have 
positive monotonic effects on D relative to C, then Proposition 6.1 and 
Corollary 5.7 imply that {X±,X 2 } is both irreducible and singular relative 
to C if /fo > 0. 

Thus only under the assumption of positive monotonic effects for both 
X\ and X2 does the sufficient condition for the irreducibility and singularity 
of {Xi,X 2 } coincide with the classical two-way interaction term /3i 2 being 
positive. Note that under the assumption of negative monotonic effects of 
X\ and X2 on D, fi\ 2 < is equivalent to irreducibility and singularity for 
D = (1 — D); see [36] for this and other remarks on recoding of exposures or 
outcomes. 

It also follows from Proposition 3.4 that if {X±,X 2 } is irreducible relative 
to C, then there exists some oj £ £1 for whom {X±,X 2 } is a minimal sufficient 
cause relative to C (since |B| = |C|). 

Example 5 (Three-way interactions). The saturated Bernoulli regres- 
sion with three binary variables and a identity link can be written as 

E[D = l\X 1 =x 1 ,X 2 = x 2 ,X 3 = x 3 ] 

= /3 + frxi + P2X2 + + fi\2X\X2 + fivsx\x 3 

+ /523^2^3 + fi\2-$X\X2X2. 

Suppose that C = \X\,X 2 ,X-$\ is unconfounded for D. Proposition 6.1 im- 
plies that {Xi,X2,X 3 } is irreducible relative to C if 

(6.5) /3i23>2/? +/?i+/3 2 + /? 3 . 
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It follows from Proposition 3.4 that if {Xi,X2,X^} is irreducible relative 
to C, then there exists some w£0 for whom {Ai,A2, A3} is a minimal 
sufficient cause relative to C (since |B| = |C|). 

Proposition 6.2 implies {Ai, A2, A3} is singular relative to C if 

0i23>60 +20i+20 2 + 20 3 . 

However, if X\, X<i and A3 have positive monotonic effects on D (relative 
to C), then Proposition 6.1 implies {Ai, A2, A3} is irreducible relative to C 
if any of the following hold: 

(6.6) 0123 > ft, 0123 > 02, /3i23>/3 3 ; 

equivalently, 0123 > min{0i, 02, /3a}. By Corollary 5.7 this also establishes 
that {Ai, A2, A3} is singular relative to C. 

If only Ai and X2 have positive monotonic effects on D relative to C, 
then Proposition 6.1 implies that {Ai, A2, A3} is irreducible relative to C if 

(6.7) 0123 > /?0 + A+ /3 2 - 

By Corollary 5.7, condition (6.7) also implies that {Xi,X2,Xz} is singular 
relative to C (since only A3 does not have a positive monotonic effect on 
D). As we would expect condition (6.7) is weaker than (6.5), but stronger 
than any of the conditions in (6.6). If only one potential cause has a mono- 
tonic effect on D relative to C, then we can only use (6.5) to establish 
irreducibility. 

Thus for three-way interactions, 0123 > does not correspond to any of the 
sufficient conditions for irreducibility or singularity of {X\, A2, A3} relative 
to C, regardless of whether or not monotonicity assumptions hold. 

7. Interpretation of sufficient cause models. As mentioned in Section 2.1 
the observed data for an individual (C(uj),D(uj)) represents a strict subset of 
the potential outcomes T>(C,uj); this is the "fundamental problem of causal 
inference." Further, as we have seen, for a given set of potential outcomes 
there can exist different determinative sets of minimal sufficient causes 53 
for the same set of potential outcomes; see (2.3) and (2.4). Thus we have 
the following for an individual: 

: \ i \ 

(71) 53 -> V(C,lo) -> (C(u),D(u)). 

many-one many-one 

It is typically impossible to know the set of potential outcomes for an in- 
dividual T>(C,uj), even when C = {A}, even from randomized experiments. 
However, possession of this knowledge would permit one to predict how a 
given individual would respond under any given pattern of exposures C = c. 
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The results in this paper demonstrate that, given data from a random- 
ized experiment (or when sufficient variables are measured to adjust for 
confounding), it is possible to infer the existence of an individual for whom 
all sets of minimal sufficient causes 03 have certain features in common. How- 
ever, given the double many-one relationship (7.1), and the fact that the set 
of potential outcomes T>(C,u>), if they were known, apparently address all 
potential counterfactual queries, it is natural to ask what is to be gained by 
considering such representations. We now motivate our results by presenting 
several different interpretations of sufficient cause representations. 

7.1. The descriptive interpretation. Under this view, sets of minimal suf- 
ficient causes are merely a way to describe the set of potential outcomes 
T>(C,Q). The representation (A, 55) may be more compact; compare Ta- 
ble 2 and (2.5). Extending this to a population f2, the variables A in a 
representation (A, 25) merely describe subpopulations with particular pat- 
terns of potential outcomes. 

Knowing that there exists an individual for whom all representations *B 
have certain features in common provides qualitative information about the 
set of potential outcomes. 

For two binary causes {X\,X2}, Theorem 3.2 implies that {Xi,^} is 
irreducible relative to C for u* if D u (uj*) = 1 and Di (u)*) = D i(oj*) = 0. 
Such a pattern is of interest insofar as it indicates that the causal process 
resulting in this individual's potential outcomes T>(C,oj*) is such that (for 
some setting of the variables in C \ {X±,X2}), D = 1 if both X\ = 1 and 
X2 = 1 , but not when X\ = 1 and X2 = or vice versa. 

Similarly it follows from Theorem 5.2 that if {X\,X2} is singular relative 
to C for uj* then Dn(u*) = 1 and D 10 (u*) = D 01 (u*) = D 00 (u*) = 0. Hence 
the causal process producing T>(C,u)*) is such that, for some setting of the 
variables in C \ {X\,X2}, D = 1 if both X\ = 1 and X2 = 1, but not when 
either l 1 = 0orl 2 = 0. 

In contrast to the classical notions of interaction arising in linear models 
(see Section 6), irreducibility and singularity are causal in that they relate 
to the potential outcomes. Sections 4 and 5 contain empirical tests for the 
presence of irreducible or singular interactions. 

7.2. Generative mechanism interpretations. A minimal sufficient cause 
representation may be interpreted in terms of a "generative mechanism" : 

Definition 7.1. A mechanism M(uj) relative to C takes as input an 
assignment c to C, and outputs a "state" M c (uj) which is either "active" (1) 
or "inactive" (0). A mechanism is said to be generative for D if whenever it 
is active, the event D = 1 is caused, so that M c (uj) = 1 implies D c (u) = 1. 
Conversely, a mechanism is said to be preventive for D if whenever M c (u) = 
1, D c (oS) = is caused. 
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Though this definition refers to a mechanism "causing" D = 1 or D = 0, 
we abstain from defining this more formally in terms of potential outcomes 
until the next section. Our reason for proceeding in this way is that there 
may be circumstances in which an investigator is able to posit the existence 
of a causal mechanism causing D = 1 or D = 0, for example, based on ex- 
periments manipulating the inputs C and output D, but lacks sufficiently 
detailed information to posit well-defined counterfactual outcomes involving 
interventions on these (hypothesized) mechanisms. 

Definition 7.2. A set of generative mechanisms M = (M 1 ,...,M P ) 
will be said to be exhaustive for a given set of potential outcomes Z?(C,f2) 
if for all well, and all c, if D c (u) = 1, then for some M l G M, M*(w) = 1. 

Note that if M forms an exhaustive set of mechanisms for Z>(C,f2), then 
it follows that in a context in which no mechanism M l is active, D = 0. 

Proposition 7.3. I/M = (M 1 , . . . , M p ) forms an exhaustive set of gen- 
erative mechanisms for 2?(C,f2), then D = \J(NL) and D c (uj) = \/(M c (w)). 

Proof. The proof follows from Definitions 7.1 and 7.2. □ 

Proposition 7.4. Suppose M forms an exhaustive set of generative 
mechanisms forT>(C,£l). 7/B G P(L(C)), |B| = |C| and B is irreducible for 
V(C;fl), then there exists an individual uj* and a mechanism M l such that 
M^ =1 (uj*) = 1 but for all L G B, M* i L y =1L=0 (w*) = 0. 

Thus if there exists an exhaustive set of generative mechanisms for £>(C, fi) 
and B is irreducible, then there is an individual u* and a mechanism M l 
such that M % is active when all the literals in B take the value 1, and is 
inactive when any one literal is 0, and the rest continue to take the value 1. 

Proof. By Theorem 3.2, since B is irreducible for T>(C; O), there exists 
w'el! such that Z?b=i(<^*) = 1 and for all L G B, -C ) B\{L}=i,L=o( a; *)- Since 
M is an exhaustive set of generative mechanisms for T>(C,D,), we have that 
for all c, D c (uj*) = V(M c (w)*). Since D b=1 (oj*) = 1, for some Mi G M, 
M^ =1 (uj*) = 1. Since for all L G B, D B \{ L y =1L=Q (uj*) = we have that 

M B\ { L}=l,L=o(^)=0- □ 

Proposition 7.5. Suppose M forms an exhaustive set of generative 
mechanisms for X>(C,f2). If B G P(L(C)), |B| = |C| and B is singular for 
T>(C;Q), then there exists an individual uj* and a mechanism M ! such that 
Mg =b (w*) = 1 if and only ifb = 1. 
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Hence under the conditions of Proposition 7.5, if B is singular, then there 
is an individual uj* and a mechanism M % such that M % is active if and only 
if all the literals in B take the value 1. 

Proof. The proof is similar to the proof of Proposition 7.4, replacing 
Theorem 3.2 by Theorem 5.2. □ 

As the next example shows, the assumption that there exists an exhaustive 
set of generative mechanisms is substantive, and does not hold in all cases. 

Example 6. Suppose C = {X±, X 2 } where X\ and X 2 denote the pres- 
ence of a variant allele at two particular loci. Let M 1 and M 2 denote two 
different proteins such that M l is produced if and only if Xi = 0, that is, 
the associated allele is not present. Finally, let D denote some characteristic 
whose occurrence is blocked by the presence of either M 1 or M 2 (or both). 
In this example, 

M XlX2 (u) = (l-x i ), 

D XlX2 {u) = (1 - Ml 1X2 ) V (1 - M 2 X1X2 ) = Xl x 2 . 

By De Morgan's law, the second equation here may also be expressed as 

1 - D XiX2 (uj) = M^ iX2 (u)M 2 iX2 (uj) = 1 - Xl x 2 . 

The mechanisms M 1 and M 2 are preventive for D, so that D = 1 only 
occurs when both mechanisms are inactive. An exhaustive set of generative 
mechanisms does not exist because in this example there are no generative 
mechanisms (all mechanisms are preventive). 

It is natural to suppose that mechanisms are "modular" and thus may be 
isolated or rendered inactive without affecting other such mechanisms. This 
may be formalized via potential outcomes: 

Definition 7.6. An exhaustive set of generative mechanisms M for 
T>(C,Q) are said to support counterfactuals if there exist well-defined po- 
tential outcomes Dc=c,M=m(w) and ^M=m(^) such that 

^C=c,M=m(w)=fl M =m(w) = (\J ( M )) m 

The important assumption here is the existence of the potential outcomes 
D m (w) and D C)Xn (uo). Note that if M supports counterfactuals then inter- 
ventions on C do not affect D if interventions are also made on M. 

Proposition 7.7. If the exhaustive set of generative mechanisms M 
support counterfactuals, then 

D M=M(uj)( uj ) = A^=C(w),M=M(w)( w ) = D{uj) 

so that consistency holds for the potential outcomes D m (co) and D Cim (w). 
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Proof. This follows because 

£ ) C=C( W ),M=M( w )(w) = 5m=M( w )(w) = \/( M ( w )) = ^t")' □ 

7.3. Counter factual interpretation of a sufficient cause representation. If 
we have an exhaustive set of generative mechanisms which supports counter- 
factuals, and further each mechanism is a conjunction of literals, then there 
will be a sufficient cause representation that itself supports counterfactuals. 

Definition 7.8. A representation (A, OS) for T>(C,Q) will be said to be 
structural if for each pair (Aj,Bj), A, Bj€ 03 there exists a generative 
mechanism (or mechanisms) M l such that M l = Ai A (/\(Bj)) and 

M i c =cH=^(^)A(/\(B 4 ))c. 

Thus if (A, 03) is structural, then each pair (^4j,Bj), A^ A, Bj€ 03 cor- 
responds to a mechanism M l . Thus in this case the variables Ai(u) may 
be interpreted as indicating whether the corresponding mechanism(s) M l is 
"present" in individual uj. We may thus associate potential outcomes with 
the Ai, corresponding to removing (or inserting) the corresponding mech- 
anism^). This interpretation of the Ai's is consistent with the notion of 
"co-cause" which arises in the literature on minimal sufficient causes. 

We note that "structural" is often used as a synonym for "causal." How- 
ever, even under the weak interpretation, a sufficient cause representation 
is causal in that it represents a set of potential outcomes. The word is used 
in Definition 7.8 to connote that the structure of the representation itself 
represents (additional) potential outcomes for a set of mechanisms M that 
correspond with the pairs (Aj,Bj), A^ A, Bj€ 23. Note that there need not 
be a unique structural representation (A, 03) for V(C,ft). There might be 
several functionally equivalent, yet substantively different, generative mech- 
anisms corresponding to a given pair (Aj,Bj); see Example 6. 

Proposition 7.9. If a representation (A, 03) for V(C,Q), where A = 
(Ax, . . . ,A p ), is structural, then the associated set of generative mechanisms 
(M 1 , . . . , M p ) is exhaustive. 

PROOF. This follows from Definitions 2.7 and 7.2. □ 

Proposition 7.10. Suppose that M forms an exhaustive set of gen- 
erative mechanisms for D(C,Q), and M supports counterfactuals. If for 
all M l € M there exists B, € P(L(C)), and an Ai such that for all c, and 
uett, if Ai(oj) = 1, then (M*) c (w) = /\( B i)c then (A = (A x , . . . , A p ), 03 = 
(Bi, . . . , Bp)) forms a representation for T>(C, f2) that is structural. 

Proof. This follows from Definitions 2.7 and 7.8. □ 
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Proposition 7.11. If there is some representation (A, 05) that is struc- 
tural, and B € P(L(C)) is irreducible for T>(C; Q), then there exists a mech- 
anism Mi that is active only if B = 1 . 

Proof. If B is irreducible for £>(C;0), then there exists Bj € OS with 
BCB;; the mechanism M i = A { A (A( B *)) is sucn that M { = \ only if B = 1. 

□ 

Note that the conclusion of Proposition 7.11, unlike those of Proposi- 
tions 7.4 and 7.5, does not make reference to an individual co* . This is 
because Proposition 7.11 assumes that there is a representation (A, OS) that 
is structural: in this representation the Ai variables may be seen as a con- 
stituent part of the corresponding mechanism Mj. 

Note that there may exist a set of exhaustive generative mechanisms, but 
these mechanisms may not themselves be conjunction of literals so that there 
is no sufficient cause representation for T>(C;Q) that is structural: 

Example 7. Suppose C = {Xi,X2} where X\ and X 2 again denote the 
presence of variant alleles, acquired by maternal and paternal inheritance, 
respectively, at a particular locus. Let M denote a protein that is produced 
if and only if either X\ = X2 = 1 or X\ = X2 = and let D denote some 
characteristic that occurs if and only if M = 1. Suppose we can intervene to 
remove or add the protein. We then have that 

M XiX2 (lo) = xix 2 V (1 - xi)(l - x 2 ), 

D XiX2 (uj) = M XiX2 (uj), 

D XlX2m (uj) = D m (uj) = m. 

Thus {M} constitutes an exhaustive set of generative mechanisms for T>(C, ft) 
We have the following representation for D(C,f2): 

D XlX2 (u) = (A 1 (oj)X 1 X 2 V A 2 (u)(l - - X 2 )) X1X2 

with Ai(uj) = A2(uj) = 1 for all w£fl. However, this representation is not 
structural because Ai(u)XiX2 and A2(uj)(1 — ^i)(l — X2) do not consti- 
tute separate mechanisms for which interventions are conceivable; there is 
only one mechanism M, the protein. Since for any Du(w) = 1 and 

-Dio(w) = Doi(u) = 0, {X±,X2} is irreducible relative to C; however it is not 
the case that there is a mechanism Mi that is active only if X1X2 = 1 since 
for the only mechanism M it is the case that M = 1 if X\ = X2 = 0. Note, 
however, in this example there is still a mechanism, namely M, that will 
be "active" if X\ = X2 = 1 but will be "inactive" if only one of X\ or X2 
is 1. 
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Example 8. To illustrate the results in the paper we consider again the 
data presented in Table 1. We let D denote bladder cancer, X\ smoking, 
X 2 the S NAT2 genotype and X 3 the *10 allele on NATl. As discussed in 
Example 5, if the effect of C = {X±, X 2 ,X 3 } is unconfounded for D and we 
fit the model 

E[D = l\X 1 =x 1 ,X 2 = x 2 ,X 3 = x 3 ] 
(7.2) = fie, + fiixi + (3 2 x 2 + (3 3 x 3 + 0i 2 xix 2 

+ Pl3XlX 3 + (3 23 X 2 X 3 + f3 123 X!X 2 X 2 , 

then if Xx, X 2 and X 3 have positive monotonic effects on D (relative to C), 
then {Xi,X 2 ,X 3 } is irreducible relative to C if any of the following hold: 

0123 > 01 . /?123 > /?2 , 0123 > 03 • 

We cannot fit model (7.2) directly with case control data. However, under 
the assumption that the outcome is rare (reasonable with bladder cancer) 
so that odds ratios approximate risk ratios, we can fit the model 

E[D = 1\X ± = x u X 2 = x 2 ,X 3 = x 3 ]/E[D = 1\X 1 = 0, X 2 = 0,X 3 = 0] 

(7.3) 

= Oixi + 6» 2 x 2 + 3 x 3 + 6i 2 xix 2 + 6*13x1x3 + 6*23x2x3 + 6*123x1x2x2, 

and the conditions for the irreducibility of {Xi,X 2 ,X 3 } relative to C under 
monotonicity of {X\,X 2 ,X 3 } become 

#123 > 01, #123 > #2, #123 > #3- 

If we fit model (7.3) using maximum likelihood, we find that 

6*123 - #1 = 1.21 (95% CI: - 3.83, 6.26), 

6*123 - #2 = 2.93 (95% CI: - 2.85, 8.72), 

6*123 - #3 = 2.97 (95%CI: - 2.80, 8.74). 

In each case, under our assumption of no confounding, the point estimate 
suggests evidence of irreducibility, under monotonicity of {Xi,X 2 ,X 3 }, but 
the sample size is not sufficiently large to draw this conclusion confidently. 
With monotonicity of {Xi,X 2 ,X 3 }, irreducibility also implies a singular 
interaction for {X\,X 2 , X 3 }. If we assume that only {X\, X 2 } or {X\, X 3 } or 
{X 2 ,X 3 } are monotonic relative to C, then the conditions for irreducibility 
in Example 5 can be expressed, respectively, as 

#123 > 1 + #1 + #2, #123 > 1 + #1 + #3) #123 > 1 + #2 + #3- 

From model (7.3) we have that 

#123 - (1 + #1 + #2) = 0.09 (95% CI: - 4.77, 4.96), 
#123 - (1 + #1 + #3) = 0.13 (95% CI: - 4.69, 4.95), 
#123 - (1 + #2 + #3) = 1-86 (95% CI: - 3.41, 7.12). 
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Again, under our assumption of no confounding, in each case the point 
estimate suggests evidence of irreducibility, under monotonicity of just two 
of the three exposures, but the sample size is not sufficiently large to draw 
this conclusion confidently. With monotonicity of two of the three exposures, 
irreducibility also implies a singular interaction for {X\,X2,X3}. The test 
for irreducibility in Example 5 without assumptions about monotonicity can 
be expressed as 

$123 > 2 + #i + 62 + #3- 

From model (7.3) we have that 

#123 - (2 + 0i + 02 + #3) = -0.99 (95% CI: - 5.86, 3.88). 

In this case, not even the point estimate is positive. 

If {Xi,X2,Xs} is in fact irreducible and if there exists a representation 
that is structural, then it follows by Proposition 7.11 that there exists a 
mechanism that is active only if X\ = 1, X2 = 1, A3 = 1. 

8. Concluding remarks. In this paper we have developed general results 
for notions of interaction that we referred to as "irreducibility" (aka "a 
sufficient cause interaction") and "singularity" (aka "a singular interaction") 
for sufficient cause models with an arbitrary number of dichotomous causes. 
The theory could be extended by developing notions of sufficient cause, 
irreducibility and singularity for causes and outcomes that are categorical 
and/or ordinal in nature; see [34]. 
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